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Abstract. Oversampling is a common characteristic of data represent¬ 
ing dynamic networks. It introduces noise into representations of dy¬ 
namic networks, but there has been little work so far to compensate for 
it. Oversampling can affect the quality of many important algorithmic 
problems on dynamic networks, including link prediction. Link prediction 
seeks to predict edges that will be added to the network given previous 
snapshots. We show that not only does oversampling affect the quality 
of link prediction, but that we can use link prediction to recover from 
the effects of oversampling. We also introduce a novel generative model 
of noise in dynamic networks that represents oversampling. We demon¬ 
strate the results of our approach on both synthetic and real-world data. 


1 Introduction 

Networks have become an indispensable data abstraction that captures the na¬ 
ture of a diverse list of complex systems, such as online social interactions and 
protein interactions. All these systems are inherently dynamic and change over 
time. A common abstraction for incorporating time has been the “dynamic net¬ 
work,” a time series of graphs, each graph representing an aggregation of a dis¬ 
crete time interval of the observed interactions. While in many cases the system 
under observation naturally suggests the size of such a time interval, it is more 
often the case that the aggregation is arbitrary and is done for the convenience 
of the data representation and analysis. However, an abundance of literature 
has demonstrated that the choice of the time interval at which the network 
is aggregated has great implications on the structures observed and inferences 
made |7I24I14I21| . 

1.1 Oversampling 

We view the system through the filter of the data we collect. This data is typically 
collected opportunistically, with the temporal rate of data not always matching 
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that of the system. With the advent of microelectronic data collection systems 
such as GPS and RFID sensors, it is often the case that data is sampled at 
orders of magnitude more frequently than the temporal scale of the underlying 
system. Therefore, it is important that the aggregation process that transforms 
the collected data into a dynamic network representation correctly accounts for 
the oversampling effects. 

Oversampling is an aspect of the data collection process that can help with 
the issue of representing continuous time discretely. It helps reduce the number of 
missing interactions and allows us to better identify persistent interactions. On 
the other hand, oversampling affects our ability to distinguish between noisy local 
temporal orderings and critical temporal orderings. For example, when analyzing 
email communication networks, the data is collected at a resolution of seconds, 
but the causality of email interactions and the emergence of complex structures 
such as communities is often more accurately represented and detectable at much 
coarser scales. 

The concept of oversampling has been studied extensively in the signal pro¬ 
cessing community. However, to the best of our knowledge, there is no natural, 
well-defined notion of oversampling that translates directly to the domain of 
graph sequences. So here we use oversampling to mean a distribution over graph 
sequences that displays the typical effects of sampling too frequently: noisy local 
temporal orderings, spurious interactions, etc. 

In this paper, we assume we are given a noisy dynamic network that has 
been observed at a fixed oversampled rate. Our goal is to recover from this 
oversampling by aggregating the network together in such a way as to remove any 
artifactual temporal orderings, while preserving any real temporal information 
such as edge co-occurrences and critical temporal orderings. We represent a 
dynamic network as a sequence of discrete snapshots, that is, as a sequence of 
graphs. Given a window size w, we bin together into a graph all edges that occur 
within each length-w time span. This results in a new dynamic network. The 
goal becomes to find the window size w that best recovers from the noise. At 
one extreme, if this window size w is the entire length of the original dynamic 
network, then this means that there is no temporal information and the network 
is static. At the other extreme, if this window size w lasts for a single snapshot 
of the original dynamic network, then this means there is no oversampling at 
all and the network is observed at the right temporal scale. Thus, the window 
size can be seen as a proxy for the amount of temporal information stored in 
the dynamic network, and finding a good window size can be seen as finding the 
temporal scale at which the network is evolving. 

1.2 Link Prediction 

In this work, we take the approach that the inference task should inform how 
to recover from oversampling, whether the task is link prediction, community 
detection, or something else. Link prediction, in particular, is an important in¬ 
ference task with many applications. It has beeen used in the analysis of the 
internet [lEZ], social networks [2], and biological networks m. It has also 






been used for designing recommendation systems PITS] and classification sys¬ 
tems [T^. See the survey of A1 Hasan and Zaki [Tj for more applications and an 
introduction to the various techniques used in link prediction. 

Given the importance of link prediction, we investigate whether this task can 
serve as a good driver for recovering the correct temporal scale of an oversampled 
dynamic network. Our approach is simple: we use the quality of a link prediction 
algorithm as a score for the window size. If we can predict links better, it means 
that we have found a good window size, not only because by definition we have 
improved the performance of our inference task, but because at this scale we can 
better capture the evolution of the network. 

To our knowledge, we are the first to formally study the relationship be¬ 
tween temporal oversampling in networks and link prediction. As such, we need 
to introduce a model of noise for dynamic networks that captures effects of over- 
sampling. We define such a model by leveraging the following observation: when 
a dynamic network is oversampled, edge occurrences are recorded near the “nat¬ 
ural” occurrence time, but are spread out around that time. We present a model 
that generates noisy, oversampled dynamic networks by distributing the times 
of edges over a Gaussian distribution. This model can be extended to capture 
unique characteristics of a given data collection process. It also is applied inde¬ 
pendently of the underlying process generating the network, allowing for a wide 
variety of phenomena to be modeled. 

In this paper, we show that oversampling affects the quality of link prediction 
algorithms. Furthermore, using synthetic data, we show that link prediction 
performs better on graph sequences aggregated near the ground-truth window 
size than on other window sizes. In this light, we can recover from the effects 
of oversampling by using link prediction to find a good window size. Finally, we 
show that our method results in robust results on real-world networks. 

In Section we give a more formal description of the oversampling prob¬ 
lem for dynamic networks and present our link prediction based approach. We 
define a generative model for Gaussian-type noise representing oversampling in 
Section as well as a generative model for dynamic networks for testing link 
prediction algorithms. In Section we use our synthetic generative model to 
test our approach. We demonstrate that our method yields reasonable results 
and show the impact of a variety of different parameters on our results. We then 
show the results of our approach on two real-world networks in Section We 
end with a few concluding remarks in Section 

1.3 Related Work 

Extensive literature has demonstrated that the choice of aggregation window 
greatly impacts the quality of the corresponding dynamic network [7121124] . 
Some work has been done in developing heuristics for identifying the “right” win¬ 
dow size or temporal partitioning, especially for numerical time series |15I26| . In 
dynamic networks, though, most work does so only in limited or slightly different 
contexts. Peel and Glauset, for example, consider the problem of finding change 
points, points at which the generative process of a dynamic network is itself 








changing m- Sun et al. also considers the problem of finding change points - 
and more generally considers the problem of finding a partition of the network - 
this time in the context of community detection |25| . In [5] and [7] , they analyze 
the discrete Fourier transform of time series of different graph metrics to iden¬ 
tify important frequencies in a dynamic network. Similarly, Sulo et al. |24) use 
information-theoretic tools to analyze time series of a variety of different graph 
metrics for graph sequences that have been aggregated at different temporal 
scales. 

Prediction as a tool to inform model selection is not new, especially in the 
literature of time series analysis. Central in this literature is the principle of 
minimum description length, which says that if data displays any regularity (i.e. 
is predictable), then that regularity can be used to shorten an encoding of the 
data. Therefore finding a short encoding is finding a good predictor, and vice 
versa. See m for a survey on the subject. In the context of temporal networks, 
this approach is not as common, but link prediction has been used to infer useful 
static networks from data [5]. 

There are a number of models in the literature for dynamic networks, includ¬ 
ing the GHRG model of Clauset and Peel [H] , the activity-driven model of Perra 
et al. [ini, and the dynamic latent-space model of Sarkar and Moore [52]. Some 
models for static networks can also naturally be seen as a model for dynamic 
networks in which the number of nodes is growing over time, such as in the 
preferential attachment model. However, none of these models are well-suited 
for modeling oversampling or for controlling the quality of a link prediction al¬ 
gorithm as we will need, so we use a novel model, which we detail in Section 


2 Problem Formulation and Methodology 

To formalize the problem, we assume that a noisy dynamic network is a sequence 
of discrete graphs (each graph representing one time step) on a fixed set of n 
nodes. Furthermore, this graph sequence is the output of some noise process 
being applied to a (possibly shorter) sequence of graphs - the ‘ground truth’ 
- on the same set of n nodes. Specifically, we assume that the noisy sequence 
can be partitioned into ‘windows’ of fixed size, and that any temporal orderings 
between edges in a single window are spurious. Thus recovery consists of the 
two step process of first choosing a window size w (which in this paper we will 
also refer to as a temporal scale of the graph sequence) and then aggregating 
all edges within each w consecutive graphs into a single graph. This results in a 
new graph sequence that is a factor of w shorter. Figure gives an illustration 
of this process. 

In general, the goal is to recover the ground truth from the noisy sequence. 
Of course, this is not always possible. In the degenerate case, the ground-truth 
sequence is just noise and this task is impossible. In this paper, we assume 
the ground-truth sequence is sufficiently non-noisy, so that a link prediction 
algorithm will perform well on it. Link prediction is the classification task of 
finding those pairs of vertices (where there is not already an edge) which are 
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Fig. 1: Example of aggregating a time series of graphs into a coarser window 
w = 2. 


most likely to form an edge in the next time step. Any similarity score on pairs 
of vertices can naturally be considered a link prediction algorithm: two vertices 
with a high similarity score are assumed to be more likely to have a link in the 
future. Liben-Nowell and Kleinberg m give a detailed list of such similarity 
scores. We will also need similarity scores to create ground-truth synthetic data, 
as detailed in Section]^ Let score{a;,y) denote the similarity score between two 
vertices and r{x) the neighborhood of x. With this notation, we use the following 
four scores, as given in m 

1 . Adamic-Adar: score(a:,y) := T,,er{x)nr(y) ioe\r{z)\ 

2. Katz^: score{x,y) := ’ |{paths of exactly length £ from x to y}\ 

3. Graph distance: score(a;, y) := —d{x, y), where d{x, y) is the distance between 
X and y. 

4. Rooted PageRank^: Consider a random walk on the graph that resets to 
vertex u with probability a and moves to a random adjacent vertex with 
probability 1 — a. Then score(a::, y) is the stationary probability of y when 
resetting to x plus the stationary probability of x when resetting to y. 

To test the quality of a window size w, we analyze the performance of a link 
prediction algorithm on the graph sequence aggregated at w. The link prediction 
algorithm gets as input the graph at time t and predicts new edges in the graph 
at time t-l-1. We predict the edges with the top k scores, where k is the number 
of edges that actually are created in the next time step in the sequence. As is 
common, we view this as a binary classification task, where pairs of vertices are 
either in the category of new edges or not. When viewed in this way, it is natural 
to score the algorithm as the correlation between the algorithm’s predicted edges 
and those edges that actually appear, which is called the Matthews correlation 
coefficienl[^MCC) [S]. Given a pair of consecutive graphs Gi and Gi+i from 
a graph sequence aggregated at a window size w, the Matthews correlation is 

^ The MCC measure is used, rather than accuracy or precision, because MCC skirts 
the issue of bias that accuracy and precision have: the number of edges appearing is 
often a very small fraction of the total number of possible edges. We use MCC over 








the [—1, l]-valued Pearson correlation coefficient for classification defined as a 
normed statistic between the predicted edges to appear in (using Gi as 
the input to the link prediction algorithm) and the actual new edges appearing 
in Gi+i- The score assigned to w is the average of these correlations over all 
pairs of consecutive graphs in the aggregated sequence. 

We test all window size^ This algorithm is summarized in Algorithm 


Algorithm 1 Assign scores to each window size 
for w = 1 to [n/3j do 

Let g' be g aggregated at size w, so g' = G'l,..., . 

for i = 0 to \g'\ — 1 do 

new links = A(G'+i) \ A(G') 
predicted links = LP{G'i,G'i^^) 
scorei = MGG{new links, predicted links) 
end for 

—n scorei 

scoret„ = - 

end for 

return all pairs w, scores 


Since this is a computationally-expensive task, for sufhciently long sequences, 
a random ten percent of the consecutive pairs of aggregated graphs are tested 
instead of all of the pairs. Based on our empirical analysis, considering only a 
subsample of the windows does not significantly affect the scores. 

Depending on the application, it may be desirable to use more than one 
different window size - more than one time scale may be interesting or perform 
highly. In addition, for a given time scale, there may be a range of window sizes 
centering around that time scale that is of interest. Moreover, we leave for future 
work determining how significant differences in quality scores are, given that we 
make no assumptions on how many different time scales are of interest for the 
application. In this light we do not return just the top-quality window size, as 
seen in Algorithm]^ 


other measures also resistant to unequally sized categories, such as AUC, because 
MCC is computationally very fast. In addition, it seems to emphasize differences 
in scores better than other measures for our link prediction task. Regardless, other 
measures give very similar scores as AUC - different measures seem to preserve the 
order of the qualities of the window sizes. 

To be more precise, we only test window sizes up to a third of the length of the input. 
We assume that if the actual window size is any bigger, then there is no temporal 
information that we can utilize and the underlying network is really a static network. 







3 Generative Models for Graph Sequences 


To create synthetic data, we will use two generative models, one for the ground- 
truth sequence representing the noiseless dynamic network, and the other, a 
noise model that takes as input a ground-truth sequence (which we sometimes 
refer to as the underlying sequence) and outputs a noisy oversampled sequence. 


3.1 Generative Model for the Ground-truth Graph Sequence 

The ground-truth graph sequence can be formed from any existing model of a 
dynamic network (such as the latent-space model or the activity-driven model 
mentioned in the introduction), but to test the performance of a link-prediction- 
based approach, we instead use a novel and simple generative process that allows 
us to test our approach, which has the advantage that the quality of the link 
prediction algorithm is a parameter of the generative process. This generative 
process starts with an initial graph G and adds a fixed but parameterized number 
of edges i5 for every subsequent graph. The edges added are the non-edges with 
the top scores as rated by a given similarity score. This model can easily be 
extended as needed, for example by deleting the edges with the lowest similarity 
score every time step as well, or still further to a probabilistic edge creation and 
deletion process. 

For the initial graph G, in this paper we consider both the Erdos-Renyi 
model G{n,p) [TU] and the preferential attachment model BA{n,m) [B]. For the 
similarity scores, we use Adamic-Adar and Katz. The use of different similarity 
scores allows us to test our approach both when the quality of the link prediction 
algorithm does well and when it does not do well. For example, if we use Adamic- 
Adar to both create the graph sequence and to do link prediction (assuming 
no noise) the link prediction algorithm will perform perfectly. However if the 
sequence is instead made with the Katz similarity score, the link prediction 
algorithm will not perform as well. Since we can’t guarantee the quality of the 
link prediction algorithm on non-synthetic data, this model allows us to see how 
link prediction performance affects our approach. 


3.2 Generative Model for Oversampling 

We now need to model the sampling process by which non-synthetic data would 
be gathered. Our primary approach to modeling noise, specifically oversampling, 
is to assume that for a given time step, the edges that occur are measured to 
be near that time step, but not necessarily at that time step. Furthermore, 
we assume the distribution of these edges in time is Gaussian. Given an input 
graph sequence of length t and parameters /r € N, S K>oi this model outputs 
a graph sequence of length approximately p ■ t that represents the sequence being 
oversampled at a constant rate /r, with controlling how concentrated the edges 
occur around the “true” times. Specifically, given these two parameters p and cr, 
for an edge that occurs in the ground-truth graph at time step i, the edge will 


occur in the new noisy graph sequence at time step j ^ where j is 

rounded to the nearest integer. 

If a is sufficiently small, then there is likely to be intermediate graphs where 
there are no edges. If windows start and end within these gaps, we can recover 
fully from this noisy process. However, as a gets larger, it becomes more and more 
difficult, as more and more edges from distinct graphs in the original network 
start getting added to the same graphs in the new noisy network. In the limit, all 
temporal information is destroyed. Figure|^gives two examples of the number of 
edges in each time step for two different settings of parameters. It’s worth noting 
that the oversampling noise model presented here can be extended so that the 
oversampling rate is non-constant, the distribution used is non-Gaussian, etc. 


4 Results for Synthetic Data 

In this section, we analyze a variety of oversampled, dynamic networks generated 
by models detailed in Section and aggregated at different window sizes. We 
investigate the effect of window size on the performance of fink prediction. We 
show that link prediction performs better on sequences aggregated at window 
sizes close to ground-truth than on other window sizes. Furthermore, this holds 
when the ground-truth sequence uses a different similarity score than the one 
used to perform link prediction. We also show that larger values of /r, while fixing 
<7 (increasing the separation between means), smaller values of cr, while fixing 
/r (higher degree of concentration around each mean), larger values of S (more 
edges appearing each time step), and higher quality of fink prediction all make 
the task of recovering the ground-truth window size easier. Yet even at higher 
levels of noise we show that this approach recovers reasonable results and gives 
some evidence that it outperforms the simpler approaches that rely solely on 
extracted time series information, such as the number of edges per time step. 



Fig. 2: MCC scores as a function of window size for three different noisy se¬ 
quences. Parameters used were fi = 100, S = 50, and from shortest to longest, 
a = 8, 20,40. The underlying graph sequence is generated by starting graph 
G{n,p) and Adamic-Adar as the similarity score. 






For the sake of brevity, given the number of parameters, including /r, cr, <5, 
the number of link prediction algorithms both for creating the sequence and for 
finding the quality scores, we do not show results for all possible combinations, 
but instead show a representative sample. 

In the remainder of this section, we fix the number of vertices n to be 250 
and fix the edge probability p for the Erdds-Renyi model at 0.05. Figurej^shows 
the drastic improvement of link prediction performance at larger windows of 
aggregation. For the sequence generated using cr = 8, as shown in this figure, 
performance at window size of 1 (no aggregation) is essentially random (average 
MCC « —10“^^), but when aggregated at a window size of 95, performance is 
perfect (average MCC = 1.0). Note that the best performing window size is 
very close to the mean separation p, = 100. When the standard deviation is 
comparatively higher, as when cr = 40, Figure shows that windowing has a 
comparatively smaller effect. Here the link prediction is not able to separate 
consecutive graphs and therefore recommends to aggregate all graphs in the 
sequence together. 




(a) p = 100, 5 = 50, a = 8 (b) ir = 100, <5 = 50, cr = 40 

Fig. 3: Number of edges in two noisy sequences over time with the given param¬ 
eters. The underlying sequences were created with G(n,p) as the starting graph 
and Adamic-Adar as the similarity score. The vertical lines indicate the borders 
of windows when the window size is 95. 


Figurej^gives further evidence for why this is the case. When cr = 40, mixing 
(of edges) between graphs is much higher than in the case when cr = 8. In this 
latter case, there is in fact no mixing at all, which is why there are many window 
sizes where link prediction performs perfectly. Since link prediction here is so 
good - it will be perfect in the absence of noise - it represents an easier case. 
However, this case still demonstrates a convenient benefit to our approach: we 
can still perform well even when the target windowing is not uniform over the 
length of the input noisy sequence. That is, the noisy sequence may have shifted 
windows and as a result, one window could be smaller than the others. In our 
































case, window size 95 performs better than a window size 100, indicating we can 
still find a uniform window size that performs well, even if it is not the same 
window size as the mean separation window. 




(a) ^ = 100, S = 50, and from shortest (b) /i = 20, S — 50, and cr = 2,4,8, 20 
to longest, O' = 8, 20, and 40 




(c) p = 20, O' = 8, and, from top to (d) = 100, 5 = 5, and, from shortest 
bottom, S = 100, 50, and 5 to longest, o' = 8, 20, and 40 


Fig. 4: MCC scores as a function of window size for different noisy sequences 
where the Katz similarity score was used to generate the underlying sequence. 
In Figure 4d the starting graph is the preferential attachment graph BA{n, 5) 
instead of G{n,p). 


Figure [^illustrates a similar behavior when the ground truth sequence was 
instead created with the Katz similarity score (testing of each window size is 
still done using the Adamic-Adar score as the link prediction algorithm). While 
overall quality scores are lower, as the link prediction algorithm itself is worse, 
the same pattern holds: near a window size of 100, the link prediction algorithm 
does significantly better. 


Our approach is more resilient to a higher value of u, as seen in Figure 4b 


However, even when edge mixing is very high {a = 20 and p = 20), link predic- 









tion is still helpful in identifying a good window size (near /r = 20). This gives 
significant evidence that our approach outperforms simpler approaches that rely 
solely on extracted time series information such as the number of edges per time 
step. Finally, link prediction does better at recovering a good window size when 6 
is higher - it retains its resiliency even for very sparse cases, as seen in Figure]^ 

As mentioned above, our generative model easily extends to more general 
models. Again, for the sake of brevity, we will not discuss the performance of 
our approach on every possible variation of our model, but we do want to note 
that our results do extend. For example. Figure [4d| shows our results when the 
following two changes have been made: The initial graph is instead of G{n,p) an 
instance of the preferential attachment model. In addition, instead of just adding 
edges, edges are deleted as well. Namely, (5 existing edges that have the lowest 
Katz score are deleted. These changes show very little impact on our results, as 
seen by comparing Figures and [4a| 

5 Results for Real-world Data 

The validation process on real-world data is difficult, in general, for inference 
tasks on networks, but especially for the problem of temporal scale identifica¬ 
tion. A significant barrier is the lack of ground truth and/or formal notions of 
what should be considered a good temporal scale. Real-world dynamic networks 
often exhibit multiple critical temporal scales corresponding to the evolution 
of different important features (e.g. communities) and processes (e.g random 
walks) [24121| . but the relationship between these important features and pro¬ 
cesses and their corresponding temporal scales is not well understood. 

Our expectation is that there can be multiple peaks in the quality of a link 
prediction algorithm as window size increases, each corresponding to different 
important temporal scales. We validate our results by making sure that different 
link prediction algorithms behave similarly as a function of window size, despite 
following different mechanisms for scoring future edges. When two different link 
prediction algorithms show peaks at the same time scale despite not necessar¬ 
ily predicting the same edges, this gives evidence that the peaks are inherit 
to the sequence and not a function of the particular algorithm. We now show 
how our approach performs on real-world data sets, namely the Haggle Infocom 
network [53] and the MIT Reality Mining network jSj. 

5.1 Haggle Infocom 

The Haggle Infocom dataset is the result of 41 users equipped with Bluetooth 
phones at the Infocom 2005 conference, over the course of four days. There is 
an undirected edge between two users at time t if they were in proximity at 
that time. The data we used was initially binned at 10 minute time intervals. 
The results are shown in Figure The four link prediction algorithms behave 
consistently with clear peaks at approximately in = 75 (« 12.5 hours), w = 110 
(« 18 hours), and w = 130 (« 22 hours). The interactions present in the Haggle 




network have a periodic nature imposed by the regular conference structure and 
our algorithm seems to identify such periodicities, in particular the half and 
one-day periodicities. 



(a) Adamic-Adar (b) Katz^ (/? = 0.005) 



(c) Rooted PageRank^ (a = 0.15) (d) Graph distance 


Fig. 5: MCC scores for four link prediction algorithms as a function of window 
size for the Haggle sequence. 


5.2 MIT Reality Mining 

The MIT Reality Mining dataset consists of 90 grad students and professors’ 
data from their cell phones in the 2004-2005 academic year. Timestamps were 
kept for three types of data: Bluetooth proximity, cell tower proximity and phone 
call communications. This naturally yields three different dynamic networks that 
we extracted from the raw data. The first network has an undirected edge at 
time t whenever Bluetooth recorded two cell phones as close at time t, the second 
has an undirected edge between two participants at time t if they were recorded 
near the same cell tower, and finally the third has an undirected edge between 
two participants at time t whenever one participant called the other. We will 
refer to these as the Bluetooth sequence, the cell tower sequence, and the call 
sequence respectively. Each network is initially windowed at a size of one day. 









The results are shown in Figure Of interest is that the three different 
networks perform - in terms of the quality of the link prediction - consistently 
differently from each other, implying that they really are different types of net¬ 
works. The Bluetooth sequence shows clear signs of oversampling; a window size 
of w = 1 has smaller quality than larger window sizes. The performance of link 
prediction on this sequence stabilizes after w = lA (roughly 2 weeks), while in 
the case of the cell tower sequence, the performance drops and then prominently 
improves at w = 65 (roughly 2 months). Considering the weekly, monthly, and 
semester structure of academic activities, these windows of aggregations appear 
reasonable in capturing the underlying dynamics of the Reality Mining networks. 



(a) Adamic-Adar (b) Katz^ (/? = 0.005) 



(c) Rooted PageRank^ (a = 0.15) (d) Graph distance 


Fig. 6: MCC scores for four link prediction algorithms as a function of window 
size for the Reality Mining sequence. At the far left of each plot, from top to 
bottom is the cell tower sequence, the Bluetooth sequence, and the call sequence 
respectively. 


These results are given weight by the agreement between three link predic¬ 
tion algorithms, Adamic-Adar, Katz, and Rooted PageRank, as seen in Figurej^ 
The exception is the graph distance algorithm, whose performance is significantly 
worse than the other three algorithms, ultimately making it difficult to discern 
the quality of different window sizes. The graph distance algorithm has been 







studied in the literature before and is identified as a very low performing link 
prediction algorithm m- Figure shows the relative performance of the four 
link prediction algorithms for the Haggle and Reality Mining datasets and re¬ 
emphasizes the conclusions in m In picking the graph distance algorithm for 
our problem, we wanted to investigate whether the performance of a bad predic¬ 
tion algorithm can be substantially improved by a better window of aggregation. 
As the analysis of the Haggle and Reality Mining datasets shows, while some 
loss in the quality of prediction can be overcome (as in the case of the Haggle 
sequence), sufficiently bad prediction cannot (as in the case of Reality Mining 
sequence), as should be expected. 




(a) Haggle sequence (b) Reality Mining cell tower sequence 


Fig. 7: Average quality of the link prediction algorithms Rooted PageRank, Katz, 
and graph distance, from top to bottom (as seen at furthest to the right) in 
each plot. Here the quality of the link prediction is measured as the average 
resemblance to the actual links that appear, where the resemblance of a set 
of predicted links to actual set is the size of their intersection divided by the 
size of their union. The scores are then normalized so that the resemblance of 
Adamic-Adar is 0.0 (the dotted line). 


6 Conclusions 

In this work, we treat link prediction as a signal that helps us understand the 
temporal dynamics of a network. Making the connection between the ability to 
predict new edges and the appropriate temporal scale that captures the evolution 
of the network, we present a novel, task-driven algorithm for de-noising oversam¬ 
pled dynamic network into more robust representations. We formally define a 
model of oversampling in dynamic networks that captures general properties of 
the noise it induces and allows for a more rigorous analysis of our algorithm. 
Across a variety of synthetic instances of noisy, oversampled dynamic networks. 







and two real dynamic networks, we show that the performance of a link predic¬ 
tion algorithm can serve as a good quality score for identifying the appropriate 
window of aggregation. 

Our work opens up several potential avenues for further investigation. As 
mentioned above, we do not discuss in this paper how to select which windows 
are outliers in terms of link prediction quality. We would also be interested in 
extending the framework presented here to non-uniform partitions of the timeline 
of the network, instead of an inform window of aggregation. 

We only investigate link prediction but other tasks, such as community de¬ 
tection or other inference tasks, may be used to drive the choice of window size. 
One direction for further work is to investigate how the choice of inference task 
affects the choice of window size and to what degree do other inference tasks 
agree with link prediction. 

Finally, we leave issues of scalability - both in terms of properties of link 
prediction as the size of the graphs gets very large and in terms of how to make 
our algorithmic approach suitably fast for such large graphs - for future work. 
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